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Abstract 

We address here the treatraent of 
metonymic expressions from a knowl- 
edge representation perspective, that is, 
in the context of a text understanding 
system which aims to build a concep- 
tual representation from texts according 
to a domain model expressed in a knowl- 



edge representation iormalism. We fo- 
cus m this paper on the part ot the se- 
mantic analyser which deals with seman- 
tic composition. We explain how we use 
the domain model to handle metonymy 
dynamically, and more generally, to un- 
derlie semantic composition, using the 
knowledge descriptions attached to each 
concept of our ontology as a kind of 



concept-level, multiple-role qualia struc- 



ture. We rely for this on a heuristic 
path search algorithm that exploits the 
graphic aspects of the conceptual graphs 
formalism. The methods described have 
been implemented and applied on French 
texts in the medical domain. 



1 Introduction 

Under the compositional assumption, semantic 
analysis relies on the combination of the mean- 
ing representations of parts to build the meaning 
representations of a whole. However, this com- 
position often needs to call on implicit knowledge 
which helps to link the two meaning representa- 
tions. This is the case, for instance, in metonymic 
expressions, where a word is used to express a 
notion closely related to its central meaning. A 
well-known stream of work addressing this phe- 



nomenon is the Generative Lexicon theory (Puste- 



jovsky, 1991 ). At the heart of this theory is a lex- 



ical semantic representation called "qualia struc- 
ture". Metonymies are considered to correspond 
to changes in the semantic types of the words in- 
volved, and the qualia structure provides the basis 
for performing type coercion in a generative way. 

We address here the treatment of metonymic 
expressions from a knowledge representation per- 
spective, in the context of the Menelas medi- 



cal text understanding system (Zweigenbaum et 



al., 1995). One of the goals of the overall system 



is to assign standardised, medical nomenclature 
codes to the input texts (patient discharge sum- 
maries). Semantic analysis starts from a syntac- 
tic representation of each sentence and produces a 
conceptual representation. It is then used by sev- 
eral language- independent, knowledge-based com- 
ponents to perform inferences (pragmatic cnrich- 



ment) an d then code assignment (Delamarre et 



al., 1995 ). Therefore, the conceptual represen- 
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tation output by the semantic analyser must be 
normalised: it must conform to a knowledge repre- 
sentation canon in which the target nomenclature 
codes can be mapped. The specification of this 
canon relies on the description of a rich model of 
the domain in a knowledge representation formal- 
ism, here Conceptual Graphs (CG) ( |Sowa, 1984| ). 

We focus in this paper on the part of the se- 
mantic analyser that deals with semantic com- 
position. The conceptual representation built 
must be abstracted from initial linguistic varia- 
tion, metonymy being a typical problem to be ad- 
dressed. We explain how we use the domain model 
to handle metonymy, and more generally, to un- 
derlie semantic composition, using the knowledge 
descriptions attached to each concept of our ontol- 
ogy as a kind of concept-level, multiple-role qualia 
structure. The methods described have been im- 
plemented and applied to French texts. 

We first recall the problem addressed (sec- 
tion H). Then, the proposed method is described 
(section^) and illustrated on an example. We give 
some information on the implementation and the 



results of the analyser (section Q), and discuss the 
relative merits of the method (section ^ . 

2 Metonymy and type coercion 



A classical example of metonymy (Pustejovsky. 
p. 428ff) is 



1991 



(1) John began a novel. 

where predicate 'began' expects an event as its 
second argument, so that some way must be found 
to relate the object 'novel' to an event such as 'to 
read a novel' or 'to write a novel'. In our do- 
main (coronary diseases), one often finds expres- 
sions such as 

(2) una angioplastie du segment II (an angio- 
plasty of segment II) 

(3) une angioplastie d'une artere coronaire (an 
angioplasty of a coronary artery) 

(4) I'angioplastie de Monsieur X (the angioplasty 
ofMrX) 

(5) une angioplastie de la stenose (an angioplasty 
of the stenosis) 

where 'angioplasty' is an action performed on a 
segment of an artery to enlarge its diameter, while 
'stenosis' is the state of an artery which has a re- 
duced diameter. These four phrases involve the 
object (or "theme") of action 'angioplasty', i.e., 
what the angioplasty operates upon. If one con- 
siders that this theme must be a physical ob- 



ject, then examples (2) -(4) conform to the selec- 
tional restrictions of 'angioplasty', while (5) vi- 
olates them. The mechanism of type coercion 



(Pustejovsky, 1991) consists in converting a word 
type into another so that semantic composition 
can work properly. K5)| is then handled as a 
metonymy, where the stenosis and the stenosed 
object enter a state/thing alternation: 'stenosis' 
is turned into an 'object'. 

However, it appears that this phenomenon is de- 
pendent on the underlying types (or "sorts") un- 
der consideration. For instance in our ontology, 
'segment', 'artery', 'stenosis' and 'human' have 
four different types, and are not comparable by 
the IS-A relation, e.g. nothing can be both a seg- 
ment and an ar tery.p| This is a volu ntary, method- 
ological choice ( Bouaud et al., 1995 ), motivated by 
the fact that these objects give rise to different in- 
ferences and must not be confused by the reason- 
ing component. Additionally, in the target nor- 
malised conceptual representation, what consti- 
tutes the specific theme (in our conceptual model, 

^Segment, in our ontology, corresponds to a portion 
of space, not of matter. 



the purported_obj) of action 'angioplasty' must 
be precisely defined. In the context of our appli- 
cation, 'angioplasty' acts on an artery_segment, 
a physical object corresponding to a part of an 
artery, which happens not to be comparable to 
any of the four preceding themes of 'angioplasty'.^ 
Therefore, all four examples (2) -(5) must be con- 
sidered as metonymies. 



To handle metonymy. Pass (1988) proposes a 
method based on a list of alternations imple- 
mented as specific metonymy rules: Part_for- 
-Whole, Container_for_Contents, etc. gowa (19*9^ ) 
considers metonymies around the term "Prix Gon- 
court", originally introduced by Kayser (198^ ): 
this term undergoes different meaning shifts in 
each of seven example sentences, ranging from the 
author who won the prize to the amount of money 
received. Sowa discusses how background knowl- 
edge could help to process these metonymies, 
based on a knowledge description of what "Prix 
Goncourt" involves. 

In our system, the target conceptual representa- 
tion is defined by a domain model expressed with 
CGs. This same model constitutes the resource 
which enables the analyser to handle metonymies. 
We explain below how results similar to Puste- 
jovsky's type coercion may be obtained with a 
method based on this domain model instead of 
a qualia structure. 

3 Method 
3.1 Rationale 

The input to the semantic analyser is the syntactic 
representation of a sentence produced b y a pre- 
vious large coverage syntactic analyser ( Berard 
Dugourd et al., 1989 ). This representation con- 
nects words, or predicates, with grammatical rela- 
tions such as subject, object, oblique object, mod- 
ifier, etc. The output of the semantic analyser is 
a conceptual graph on which pragmatic inferences 
are performed to enrich the representation. 

In the semantic lexicon, each word points to one 
or more conceptual representations. The gram- 
matical link between two words in a sentence ex- 
presses a conceptual link between their two associ- 
ated conceptual counterparts. The task of the se- 
mantic analyser is to identify this conceptual link. 
Rather than including the knowledge needed for 
this task in the semantic lexicon, or in a specific 
rule base, the program will examine the domain 



Notice, though, that these types are strongly 
linked (by relations other than is-a) through the 
knowledge base models. The semantic analyser pre- 
cisely recovers these links thanks to the mechanism 
presented in this paper. 



knowledge to resolve the link. The method relies 
on a heuristic path search algorithm that exploits 
the graphic aspects of the conceptual graphs for- 
malism. 

3.2 Domain knowledge 

The main domain knowledge elements consist of 
the domain ontology (Fig. |l|) which is a subsump- 
tion hierarchy of concept types (henceforth simply 
'types') and of relation types, and of a set of ref- 
erence models attached to the main types. 

The reference model of a type represents knowl- 
edge about this type as a conceptual graph 
(Fig. H). Basically, a conceptual graph is a bi- 
partite graph with concept nodes (or concepts) 
labeled with a type plus an optional referent, and 
relation nodes labeled with relation types ( |Chein 
and Mugnier, 1992). A model of a given type has 
an identified head concept with the same type, 
and the network of its related concepts represents 
its associated knowledge. Since types are organ- 
ised in an IS-A hierarchy, this knowledge is also 
inherited. 



ical link will therefore be computed dynamically 
by the semantic analyser using its context: the 
linked predicates and domain knowledge. How- 
ever, each grammatical relation may have concep- 
tual preferences for types or for conceptual rela- 
tions. These preferences are associated with the 
grammatical relation. Our grammatical relations 
include oblique complements, so that prepositions 
in our semantic lexicon are expressed under this 
second paradigm (Fig. 



Entry angioplastic_f is [Angioplasty: *x]. 
Entry stcnosc_f is [Stenosis: *x]. 
Entry scgmcnt_II_f is 
[Segment JI:*x]- 

(relative_to) — > [Artery] 
{spatial_role) ■* — [Spatial_Object] 
— *(zonc_of ) — > [Artery .Segment] . 

Grammatical_rel de_f :prefers 
purported_obj involved_obj pat 
motivated_by before_state after_state rel. 



Figure 3: Some semantic lexicon entries for pred- 
icates and a grammatical relation. 



Model Angioplasty (*x) is 
[Angioplasty: *x]- 

(pat ) — ^ [Human_Being: *pat] — » (cultural_function) — * 

[MedieaLSubf unction] — *(cultural_role) — > [Patient] 
(agt ) — > [Human_Being: *doe] — > (cultural_function) — * 

[MedicaLSubfunction] — *(eultural_role) — > [Physician] 
(motive) ^ [State.Of.Mind]- 

(state_of ) — * [Human_Being:*doc] 
(content)— ^[Stenosis:*stl] % 
(purported_ob j ) — > [Artery .Segment : *as] - 
(involves) ■( — [Stenosis: *st 1] 
(involves) < — [InternalJ3tatc: *is3] 
(part)< — [HumanJ3eing:*pat] % 
(descriptive^oal)^ — [Internal-State: *is3]- 

Figure 2: An extract of reference model for type 
Angioplasty. 

3.3 Semantic lexicon 

The semantic analyser relies on a two-tier seman- 
tic lexicon: one for predicates, the other for gram- 
matical relations. Predicates map to conceptual 
graphs; most of them are reduced to one concept, 
since most of the words in the lexicon are techni- 
cal terms for which a type exists. Figure H reports 
some lexical entries. 

It is difficult to map grammatical relations 
to static, predefined conceptual representations, 
since their meaning in the domain depends on 
their context of use, and mostly on the predi- 
cates they link. Besides, one cannot think of 
envisioning all the possible uses of such a rela- 
tion, partly because of the use of metonymy. The 
conceptual representation of an actual grammat- 



3.4 Algorithm 

Given an input triple predicate, grammatical rela- 
tion, predicate (Pi; Gr; P2), the semantic analyser 
first replaces the two predicates with their seman- 
tic entries — two conceptual graphs. It then en- 
deavours to link them, that is, to find a concept- 
level relation between their two head concepts Ci 
and C2 that, first, is compatible with the semantic 
preferences of grammatical relation Gr, and, sec- 
ond, conforms to the representational canon made 
of the reference models. 

3.4.1 Design principle. 

The basic idea is to project the two head con- 
cepts onto the domain knowledge and find a 
plausible concept-level relation between the two. 
We implement this by heuristic graph traversal 
through the reference models and the type hierar- 
chy, looking for a chain made of concepts and con- 
ceptual relations {i.e. a linear conceptual graph), 
which could link concepts of the same types as Ci 
and C2 and at the same time would satisfy the 
conceptual preferences of Gr. Semantic analysis 
then consists in solving recursively every gram- 
matical link starting from the sentence head pred- 
icate and then joining the obtained conceptual 
chains to build the conceptual representation of 
the whole sentence. We focus here only on the 
link resolution algorithm. 



State 

I 

Stenosis 



Entity 

Intentional_Object " SpatiaI_Role_function 
Intentional_Change Segment_II 
Angioplasty 



Physical_object 
Anatomical Element 



Vessel 

I 

Artery 



Artery _S egment 
Lad_Segment_Il 



Figure 1: An extract of the domain ontology. 



3.4.2 Chain production methods. 

We consider that each predicate Pj is associated 
with the head concept Ci of a model Mi. Let Ti 
be the type of Ci. We also assume a partial order 
on types. We focus here only on the strategy for 
producing the set of all possible chains between C\ 
and C2. We can use three methods of increasing 
complexity to find chains to link C\ and C2: 



As the type hierarchy is, in our system, a tree 
( Bouaud et al., 1995 ), the models for a predicate 
are strictly ordered. Considering two grammati- 
cally linked predicates, the product of their mod- 
els constitutes as many model pairs that can be 
potentially used to look for possible chains. Such 
pairs are structured by a partial order based on 
the generality rank of their members.^ 



1. Concept fusion: the two concepts may be re- 3.4.4 Heuristic chain selection. 



dundant. 

If Ti < T2 or Ti>T2, then Ci and C2 could 
be merged, and an empty chain is returned. 

2. Concept inclusion: a concept may be "in- 
cluded" in the other's model. 

(a) For every concept C" of type T' in Mi 
such that T' > T2, every path between 
Ci and C in Mi is a returned chain. 

(b) For every concept C" of type T' in M2 
such that T' >Ti, every path in M2 be- 
tween C" and C2 is a returned chain. 

3. Model join: two arbitrary concepts in the two 
models could be joined. 

For every pair of concepts (C^jCj) where C- 
of type T( is in M^, and such that T{ < 
or T{ > T2, all the paths Pathsi between Ci 
and C[ in Mi and Paths2 between and 
C2 in M2 are produced. Then, for every pair 
{pijP2) in Pathsi x Paths2, the chain made 
of the two paths where last{pi) is joined to 
first{p2) is returned. 

At this point, we are provided with all chains ex- 
tracted from the pair of models (Mi, M2). 

3.4.3 Model identification. 

The models that associate knowledge to a given 
predicate P can be ranked according to their level 
of generality. The most specific model is the pred- 
icate definition in the semantic lexicon. The next 
one is the reference model associated with the type 
T of the head concept of the definition. Then, the 
following models are the reference models inher- 
ited along the ontology through supertypes of T. 



At this stage, we are provided with all the pos- 
sibles chains between Pi and P2 extracted from 
their models. The remaining problem is to choose 
the most appropriate chain to substitute for Gr. 
After some experimentation, we chose the follow- 
ing scheme. The best chain is selected accord- 
ing to five heuristic criteria: (1) satisfiability of 
Gr preferences; (2) most specific model pair, i.e., 
the use of most specific knowledge associated with 
words is prefered; (3) simplest chain production 
method (see 3.4. 2|) ; (4) most specific or high- 
est priority of Gr preferences; (5) shorter chain 
length. When multiple chains remain in competi- 
tion, one is selected randomly. 

To reduce search, the link resolution strategy 
does not consider all possible chains, and imple- 
ments the first two criteria directly in the chain 
production step. Chains that violate Gr prefer- 
ences are discarded, and model pairs are explored 
starting from the most specific pair. 

3.5 An example 

Let us illustrate the resolution on example |(2)| 
(an angioplasty of segment II). The input triple 
is (angioplastie_f;de_f;segmentJI_f). The corre- 
sponding types. Angioplasty and Segment_ll, 
are not compatible and the "fusion" method fails. 
The "inclusion" method also fails since no model 
for angioplastie_f includes a concept compatible 
with Segment_ll, and no model for segnient_ii_f 
includes a concept compatible with Angioplasty. 



•^A model pair (mi, 7712) is more spe- 
cific than (m'i,m2) if maxjrank{mi,m2) is less than 
maxjrank{m'i,m'2), or if equal, minjrank{mi,m2) is 
less than minjfank(m'-i,m'2). 



However, with the "join" method, the algorithm 
identifies 6063 possible chains that satisfy the 
preferences attached to preposition de_f (Fig. ||). 

The selected chairuuses the reference model of 
Angioplasty (Fig J3) and the definition graph for 

segment_II_f (Fig. Hj which are connected on con- 
cept Artery_Segment. The resulting conceptual 
representation joins the two corresponding paths: 

[Angioplasty] — ^ (purported_obj ) — > [Artcry_Scgmcnt] . 

[Artery .Segment] < — (zone_of ) < — [SpatiaLObject] 

— > (spatiaLrole) ^ [SegmentJI] . 

into 

[Angioplasty] — > (purported_obj ) — * [Artery_Scgment] 
«—(zone_of)<— [SpatiaLObject] 

— > (spatiaLrole) — ^ [SegmentJI] . 

This representation reflects the fact that in the 
context of an 'angioplasty', 'segment IT is consid- 
ered from the point of view of the physical artery 
segment the angioplasty is to act upon (instead of 
the spatial notion Segment_II expresses). 

4 Implementation and results 

This analyser has been implemented on top of a 
conceptual graph processing package embedded in 
Common Lisp. In the current state, the ontol- 
ogy contains about 1,800 types and 300 relation 
types; over 500 types have their own reference 
model; the lexicon defines over 1,000 predicates 
and about 150 grammatical relations and prepo- 
sitions. The analyser correctly handles typical 
expressions found in our texts, including exam- 
ples |(2)| - |(5)| (see table |l|) . The complete process- 
ing chain has been tested on a set of 37 discharge 



summaries (393 sentences, 5,715 words) (Zweigen 



baum et al., 1995). This corpus included devel- 



opment texts, so the results are somewhat opti- 
mistic; on the other hand, the system is in an 
incomplete state of development. The test con- 
sisted in code assignment and answering a fixed 
questionnaire, the gold standard being given by 
health care professionals. Overall recall and pre- 
cision were measured at 48 % and 63 % on the 
coding task, and 66 % and 77 % on the question- 
naire task. 

No evaluation has been performed on more ba- 
sic components of the system; we can however 
provide statistics drawn from the global test for 
the semantic analyser. For 274 sentences received, 
the link resolution procedure was called on 8,749 
grammatical links and explored 247,877 chains, 
with an average of 28 chains per call and 904 per 
sentence. The number of paths found depends 
heavily on the richness of the models used, which 
varies with the types involved. For instance, the 
model for type angioplasty (involved in table ^) 
is central in the domain. It is the most complex 



in the knowledge base and contains 54 concepts 
and 78 relations, which accounts for the greater 
number of paths found in these examples. 

However, inadequate expansions are sometimes 
made due to lack of models, or to their complex- 
ity, which makes the heuristic principles not se- 
lective enough. Such limitations also stem from 
a lack of "actual" semantic knowledge. The se- 
mantic analyser goes directly from grammatical 
relations to conceptual relations without any in- 
termediate semantic representation. Useful infor- 
mation, such as the argumental or thematic struc- 
ture of predicates {e.g., Mel'cuk et al. (1995| ), 
Pugeault et al. (1994)), could probably overcome 
some of its shortcomings. 

5 Discussion 

One could compare this approach to a concept- 
based, multi-role qualia structure. The semantic 
definition of a word is here the reference model of 
its head concept type; each relation path starting 
from the head concept of this reference model is 
similar to a qualia role, in that it describes one of 
the semantic facets or possible uses of the word. 
In the context of a predicate, one of the concepts 
in the reference model is selected as the incom- 
ing point of a link from the predicate's meaning 
representation. 

The concept-oriented domain-model approach 
advocated here hypothesizes that the behaviour of 
words is driven by their conceptual roles in the do- 
main. This has the advantage of factoring knowl- 
edge at the conceptual level, rather than having 
to distribute it at the level of words. This knowl- 
edge can then be shared by several words. Sharing 



even occurs across languages {e.g. Dutch (Spyns 
and Willems, 1995D). 



Moreover, the type hierarchy allows concepts, 
hence words, to inherit reference models from 
more abstract concepts, thus enabling more shar- 
ing and modularity. The distinction between lo- 
cal information and information inherited through 
the hierarchy is furthermore exploited when rank- 
ing different chains between two concept types. 

Another difference resides in the way flexibility 
i s obtained. In Pus tejovsky's coercion mechanism 
( Pustejovsky, 1991 ), the argument's semantic type 
changes for a semantic type found in one of its 



qualia. In a variant approach ( Mineur and Buite- 



laar, 1995 ), a word has no a priori semantic type; 
it is selected at composition time among the types 
found in the qualia. In our approach, the head 
concept type associated with an argument does 
not change. The chain found between this con- 
cept and the predicate's head concept only brings 



Table 1: Conceptual representations obtained for sentences (2)-(5). 



{^^J phrase total chains method 


models 


jmrtial chains selected 




1(2)1 'angioplasty of segment IP 6063 join Angioplasty 
[Angioplasty] ^ (purported_obj ) — >■ [Artcry_Scgment] 

[Artery .Segment] (zone_of ) [SpatiaLObject] — > (spatial_rolc) — > [Segment JI] 
1 — 1 'segment IF definition 


1(3)1 'angioplasty of a coronary artery' 2387 inclusion 
JA^igioplasty] ^ (purportcd_obj ) — > [Artery .Segment] (part ) [Coronary_A.rtery] 


Angioplasty 


1(4)1 'angioplasty of Mr X' 3633 inclusion 
ifS^gioplasty] ^ (purportcd_obj ) — > [Artery .Segment] -i— (part) [Human^Bcing] 


Angioplasty 


|(5)| 'angioplasty of a stenosis' 2217 inclusion 
[Angioplasty] —>(purported_ob Artery .Segment] (involves) -t— [Stenosis] 


Angioplasty 



forward intermediate concepts and relations which 
are actualised in the presence of the predicate, and 
lead to a particular representation of their mean- 
ing. As a side-effect, this approach is able to han- 



dle sentences like (6) -(7) 



(6) John bought a long novel ( Godard and Jayez 
19931) 



(7) an angioplasty of a severe stenosis 

Since the modifier (long, severe) and the action 
(verb 'bought', noun 'angioplasty') require incom- 
patible types of the same noun (novel: event vs 
object, stenosis: state vs object), type changing 
via coercion cannot work on such sentences. This 
problem does not occur in our approach. 

Type coercion assumes that the predicate drives 
semantic composition, and that the semantic rep- 
resentation of the argument must adapt to it. In 
our method, both predicate and argument can 
make a step towards finding their semantic link. 
The resulting conceptual chain, as a whole, repre- 
sents both the specific facet of the argument which 
is involved in the sentence and the conceptual role 
it plays in the predicate. 

The preferences that grammatical relations as- 
sign to conceptual relations drive path selection, 
taking into account the specific syntactic context 
in which a semantic composition is to occur. This 
is crucial to let, e.g., prepositions, influence the 
choice of the conceptual link and the resolution of 
the metonymy. 

6 Conclusion 

The overall goal of the Menelas text understand- 
ing system was to build a normalised conceptual 
representation of the input text. The aim of se- 
mantic analysis, in this context, is to build a repre- 
sentation which conforms to a domain model. We 
therefore experimented how this domain model 
could help semantic analysis to go from the flex- 
ibility of natural language to a constrained con- 
ceptual representation, a typical problem encoun- 



tered being metonymy. The approach presented 
here shows how this can be performed. It has 
been fully implemented, and used with a reason- 
able size knowledge base as a part of the Menelas 
text understanding system. 

Metonymy processing is based on the domain 
model. Provided a new domain and task, with 
the corresponding domain model, this enables the 
generic method to adapt directly to this new do- 
main and give results that are speciflc to it. Build- 
ing such a domain model is generally feasible in 
sufficiently limited domains, typically, technical 
domains. Much of the strength of the method 
then hinges on the quality of the domain model: 
the concept type hierarchy and the attached ref- 
ercncc models must be built in a principled way 
(iBouaud et al., 1995|) . 
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